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ESTIMATING THE DEGREE OF ACTIVITY OF 
JUMPS IN HIGH FREQUENCY DATA 

By Yacine Ait-Sahalia^ and Jean Jacob 

Princeton University and UPMC (Universite Paris-6) 

We define a generalized index of jump activity, propose estima- 
tors of that index for a discretely sampled process and derive the 
estimators' properties. These estimators are applicable despite the 
presence of Brownian volatility in the process, which makes it more 
challenging to infer the characteristics of the small, infinite activity 
jumps. When the method is applied to high frequency stock returns, 
we find evidence of infinitely active jumps in the data and estimate 
their index of activity. 

1. Introduction. Using high frequency financial data, which are now 
widely available, we can hope to answer a number of questions regarding 
the characteristics of the process that drives asset returns. Let us model 
the log-price X of some asset as a 1-dimensional process, which we will ob- 
serve over a fixed time interval [0, T] at discrete times 0, A„, 2A„, . . . with a 
time interval A„ between successive observations that is small. This is the 
essence of high frequency data. Let us further assume that this process is 
an Ito semimartingale, meaning that its characteristics are absolutely con- 
tinuous with respect to Lebesgue measure. So, it has a drift, a continuous 
martingale part that is the integral of a possibly stochastic process with 
respect to a Brownian motion, and we will also let it have jumps with a 
possibly stochastic Levy measure. 

For modeling purposes, one would like to infer the characteristics of X 
from the observations; that is, its drift, volatility and Levy measure. When 
the time interval A„ goes to 0, it is well known that one can consistently 
infer the volatility under very weak assumptions. However, such consistent 
inference is impossible for the drift or the Levy measure, if the overall time 
interval [0,T] is kept fixed. 
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In fact, even in the unrealistic case where the whole path of X is observed 
over [0,T], one can infer neither the drift nor the Levy measure. One can, 
however, hope to be able to characterize the behavior of the Levy measure 
near 0: first, whether it does not explode near 0, meaning that the number of 
jumps is finite; and second, when this number is infinite, we would like to be 
able to say something about the concentration of small jumps. Our objective 
in doing so is to provide specification tools for financial models, where the 
presence or at least possibility of large jumps is generally accepted. There is 
much less consensus in the literature regarding the nature or even the need 
for small jumps. 

For this purpose, let us define, for a generic semimartingale X, 

(1) B{r)t=Y.\AXs\'\ It = {r>0:B{r)t<oo}, A=mf(/t), 

s<t 

where AXg = Xg — Xg- is the size of the jump at time s, and r > 0, with 
the convention = 0. Necessarily, the (random) set It contains the interval 
{f3t, oo), whereas it may contain f3t itself or not. Moreover, 2 £ It always, 
and, of course, 1 1— > /3t is nondecr easing. Hence, if we observe the whole path 
of X over [0,T], we know the sets and the numbers (3t{'^) for all t <T. 

We call Pri^) the jump activity index for the path 1 1— > Xt{tL!) at time T 
(or, more precisely, up to time T). We define this index in analogy with the 
special case where X is a Levy process. In this case. It and (3t ai'e no longer 
random. Further, they do not depend on the time t, and It is also the set 
of all r > such that /||^|<i} \x\^F{dx) < oo, where F is the Levy measure. 
This property shows that, for a Levy process, the jump activity index coin- 
cides with the Blumenthal-Getoor index of the process [see Blumenthal and 
Getoor (1961)]. In the further special case where X is a stable process, P is 
also the stable index of the process. 

When X is a Levy process, the interval / and the index /3 are, of course, 
only tiny elements of the whole Levy measure F, which convey approxi- 
mately the same information (/ gives slightly more information than (3). 
However, the value of /3 is probably the most informative knowledge one 
can draw about F from the observation of the path t^Xt for all t <T 
when T is finite. Things are very different when T ^ oo, though, since ob- 
serving X over [0,oo) completely specifies F. But, when the time horizon T 
is kept fixed, and with the whole path observed over [0,T], we can infer only 
the behavior of the Levy measure F near (because we need a potentially 
infinite number of observations for consistent estimation). Then, /? captures 
an essential qualitative feature of F, which is its level of activity, which is 
that when /3 increases, the (small) jumps tend to become more and more 
frequent. 

(3 is related to the "degree of activity" of jumps. All Levy measures put 
finite mass on the set (— oo, —e] U [e, +oo) for any arbitrary e > 0; therefore. 
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if the process has infinite jump activity, then it must be because of the 
"small" jumps, which are defined as those smaller than e. If F{[—e,e]) < oo, 
then the process has finite activity and € /, or, equivalently, /? = 0. But, 
if F{[—e,e]) = oo, then the process has infinite activity, and, in addition, 
/? > as long as the Levy measure F{[—e,e]) diverges near at a rate larger 
than a power for some a > 0. The higher /3 gets (up to 2), the more 
active the small jumps become. The same remarks also apply for general 
semimartingales. These properties are what motivate our calling (3 a jump 
activity index and our interest in estimating it. 

In the more realistic situation where the semimartingale X is only ob- 
served at times iA„ over [0,T], the estimation problem is made more chal- 
lenging by the presence in X of a continuous martingale part. By its very 
nature, Pt characterizes the behavior of F near 0. Hence, it is natural to 
expect that the small increments of the process are going to be the ones that 
are most informative about Pt- But, those small increments are precisely 
the ones where the contribution from the continuous martingale part of the 
process is inexorably mixed with the contribution from the small jumps. 
Being able to "see through" the continuous part of the semimartingale in 
order to say something about the number and concentration of small jumps 
is going to be the challenge we face as we attempt to estimate Pt- 

Related to this paper are Woerner (2006), who proposes an estimator of 
the jump activity index and of the Hurst exponent, but in the absence of 
a continuous Brownian part to the semimartingale, and Cont and Mancini 
(2007), who propose a test for the finiteness of the variation of the jump 
part. Also, Belomestny (2008) estimates the same index when there is no 
Brownian part and when, together with the prices, some option prices also 
are recorded. Another related problem is the estimation of the index /? of 
a stable process [see, e.g., DuMouchel (1983)]. However, our situation here 
is fundamentally different from those, in that we also have a continuous 
part in the semimartingale. The situation is also different from that in Ai't- 
Sahalia and Jacod (2008), where we studied Fisher's information for the 
parameters of a Brownian plus stable pure jump process, but the jump 
process was the dominant component in that paper. Here, the continuous 
part of the semimartingale dominates the small increments, and we estimate 
the activity index of a pure jump process where the dominant component is 
a Brownian motion. 

The aim of this paper is to construct estimators Pn{T) for Pt, which are 
consistent when A„ — > 0, and to provide rates of convergence and asymptotic 
distributions. Ideally, we would also like to have estimators that ctrG, clS much 
as possible, model-free, in the sense that they behave well without too strong 
assumptions on the form of the drift, the volatility or the Levy measure. 

As it turns out, a fully model-free behavior of the estimators may be 
too much to ask. The assumptions we make below on the drift and the 
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volatility process are quite unrestrictive, but obtaining rates of convergence 
will require more specific assumptions on the Levy measure. In particular, 
we will assume that the main part of the Levy measure near behaves locally 
like the Levy measure of a stable process, and we will provide estimators 
and their properties when the index is (3 > 0. This assumption seems to be 
unavoidable, since, as we shall also see, even when X is a Levy process, 
strong assumptions on the Levy measure are necessary. At this juncture, it 
may be worth noting that considering semimartingales rather than simply 
Levy processes, or exponentials of such, does not change or weaken the 
results. 

The paper is organized as follows. In Section 2, we formally define the 
index of jump activity, construct estimators for it and present the main 
properties of the estimators in the general case where the process is a semi- 
martingale. Section 3 is devoted to the special and simpler case of a symmet- 
ric stable process, and Section 4 is about more general Levy processes. We 
propose a small sample bias correction in Section 5. We present the results 
of Monte Carlo simulations in Section 6, and we compute our estimators 
over all 2006 transactions of the Dow Jones stocks in Section 7, focusing in 
particular on Intel and Microsoft. Section 8 is devoted to technical results 
and to the proof of the main theorems, which apply to Ito semimartingales, 
under suitable assumptions on the Levy measure. 

2. The model and main results. 

2.1. Defining an index of jump activity. Our structural assumption is 
that X is a 1-dimensional Ito semimartingale on some filtered space 
(fi, J^, (.Ft)f>o,F), which means that its characteristics (i?,C, i^) are abso- 
lutely continuous with respect to Lebesgue measure [see Jacod and Shiryaev 
(2003) for all notions not explained here] . In other words, the characteristics 
of X have the form 



Here, b = (bt) and a = (at) are real-valued optional processes, and Ft = 
Ft{uj,dx) is a predictable random measure, meaning that for all Borel sets 
A in M the process {Ft{A)) is predictable (possibly taking the value +oo). 
This model is quite general. For instance, the drift, volatility and jump 
measures can be stochastic and jump themselves. 

There are other ways of expressing this assumption, for example through 
a Wiener process W and a Poisson random measure fi with compensator 
u{dt,dx) = dt® dx (up to a possible enlargement of the space), as 
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(3) + ^5(s,j;)l{|5(^^^)l<i}(^- 

In this formulation, b and a are the same as in (2), 5 = 6{uj,t,x) is a 
predictable function and the connection with Ft is that Ft{u;,dx) is the 
restriction to M \ {0} of the image of the Lebesgue measure by the map 
X 1-^ 5{uj,t,x). However, it is easier for the problem at hand to express the 
assumptions on Ft rather than on 6, which, moreover, is not unique (whereas 
Ft is uniquely defined, up to null sets). 

Below, for any measure on R we denote by H its (symmetrical) tail 
function 

(4) x>O^H{x) = H{[-x,xr). 

Observe that B{r)t < oo if and only if B'{r)t < oo, where B'{r)t = 
X]s<t |AXs|'' A 1, and the process B'{r) is finite-valued if and only if it is 
locally integrable. In other words, divergence, when it occurs, is caused by 
the small jumps. Moreover, for any stopping time T, we have 

E{B'{r)T)=K(^J^ ds J (|xr Al)Fs(dx)y 

We call instantaneous jump activity index at time t the (random) number 

(5) Pi=mf{r>0: f {\x\'' Al)Fs{dx) < oo] 



In light of (5), this is a natural generalization of the notion of Blumenthal- 
Getoor index for Levy processes [see Blumenthal and Getoor (1961)]. /?* is 
a predictable process taking its values in [0,2]. This process is also charac- 
terized by the property that, for any e > 0, we have 

(6) lim 3;^t+^Fj(2;) = 0, limsupx^*~^Ft(x) = oo. 

The "lim sup" above is usually not a limit. 

Note, finally, that PI = does not necessarily imply that the process has 
finite jump activity, since it is possible for the Levy measure to diverge 
slowly near 0, at a subgeometric speed. An example of this would be the 
Gamma process, which has F{dx) = {rie-x.-p{—Kx)l^x>o}/x)dx, so that the 
Levy measure Ft{e) diverges at a logarithmic rate in e. Finite activity pro- 
cesses (compound Poisson) will have [31 = Q a.s., however. 
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2.2. Assumptions. We make two assumptions. The first one, on the drift 
b and volatihty a, is quite mild. 

Assumption 1. The processes b and a are locally bounded. 

The second assumption, on the Levy measures Ft, is more specific. Es- 
sentially, we split Ft as Ft = Fl + F/', where: 

• -F/ is very close to the Levy measure of a /3-stable process, restricted to a 
random interval {—Zt \zt^^) around with some (3 that is not random; 
the random interval may be empty for some but not for all; 

• Fj.' is another Levy measure with jump activity index less than some 
/?'</?. 

The precise statement of the assumption is as follows. 

Assumption 2. There are three (nonrandom) numbers (3 G (0,2), /?' £ 
[0,/3) and 7 > 0, and a locally bounded process Lt>l, such that we have, 
for all {uj,t), 

(7) Ft = Fl + F[\ 

where: 

(a) Fj- has the form 



zl '<x<0}' 




for some predictable nonnegative processes al^\at \zt^'^ and zl ^ and some 
predictable function f{uj,t,x) satisfying 

l + kl/(t,x)>0, |/(t,x)|<Lt; 
(b) Fj-' is a measure that is singular with respect to F/ and satisfies 

(10) / {\xf M)Fi'{dx)<Lt. 

We will also need the increasing and locally bounded process 

(11) At= A, ds, where At = ' Z ' ■ 

Jo p 
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Remark 1. In view of (6), the instantaneous index at time t "due to" 
part F/ of the Levy measure is (3 on the set {At > 0}, and 0, otherwise; 
whereas, the one due to F/' is everywhere smaller than /?'. Hence, outside a 
null set, we have (3t = P on the set {At > 0} and (3t < P' , otherwise. 

Remark 2. One could formulate Assumption 2 slightly differently by 
writing Ft = + F^, where F^ is given by (8) with f{t, x) = and Ft sat- 
isfying (10) with some P" , and, further, the restriction of Ft to 
has an absolutely continuous part with a density of the form f{t,x)\x\'^~^~^ 
with (9). The two formulations are equivalent, provided that we take /?' = 
/3"v(/3-7). 

Remark 3. Take any process of the form 

(12) dXt = btdt + at dWt + <5t„ dYt + 5't_ dY^ , 

where 5 and 5' are cadlag adapted processes, Y is /3-stable or tempered 
/3-stable and Y' is any other Levy process whose Levy measure integrates 
\x\^ near the origin and has an absolutely continuous part whose density 
is smaller than K\x\'^~^~^ on [—1,1] for some 7 > (e.g., a stable process 
with index strictly smaller than /?'). Then, X will satisfy Assumption 2. 
For instance, when further Y is symmetrical with Levy density D/\x\^^^, 
it is satisfied with the two numbers f3 and /?', 7 (as above), f(t,x) = 0, 
= 4+) = land at^=4-^^=D5l. 

Remark 4. When X is a Levy process, so that Ft{io,dx) = F{dx) does 
not depend on t and Assumption 2 is related with the property that X is 
a "regular Levy process of exponential type," as introduced in Boyarchenko 
and Levendorskii (2002), with [3 = u and (3' f\{(3 — ^) = v' in the notation of 
that paper. These two assumptions are not exactly comparable. The one in 
Boyarchenko and Levendorskii (2002) is more stringent about the behavior 
of "big" jumps, whereas ours is slightly more demanding for "small" jumps. 

2.3. The estimators. Recall that we observe Xi^^ for i = 0, 1, . . . , [T/ A„] . 
While the processes B{r) are defined from the jumps AX^ = X^ — Xs~ of 
X, we do not observe these jumps directly. Rather, all that we observe are 
the discrete increments 

(13) = XiA„ - ^(i-i)A„ • 

From these increments, we could try to evaluate B{r)T and then infer (5. 
Finding consistent estimators for B{r)T is easy, but deducing from them 
an estimator for [3 is almost impossible, because we need to decide whether 
B{r)T is infinite or not based on a finite sample. 
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So, we propose the following idea. For fixed w > and a > 0, we write 

lt/A„] 

(14) U{w,a)'^= l{|A7X|>aA-} 

i=l 

for the number of increments whose magnitude is greater than aA^ . In all 
cases below, we will set zu < 1/2. 

To better understand our rationale for doing this, consider the special case 
X = aW + Y, where y is a /3-stable process, so Pti^^) = P- Any increment 
A^X = XiA„ - ^{i-i)A„ satisfies AfX = aAj/^Wi + Ai^'^Yi (equality in 
law). Then, recalling that /? < 2 and A„ 0, with a large probability 

1/2 

is close to a An W\ in law. Those increments give essentially no information 

1/2 

on Y and are of order of magnitude An ■ However if Y has a "big" jump 
at time s, the corresponding increment is close to Ay^- Hence, one has to 
throw away all the "small" increments. However, /? is related to the behavior 
of F near and, hence, to the "very small" jumps of Y . This is why we will 
use only increments bigger than a cutoff level aA^ for some w G (0, 1/2). 

1/2 

Asymptotically, those increments are big, because, ' <C A^, the 

main contribution is due to Y . Those increments mostly contain a single 
"big" jump of size of order at least AJf , and we still get some information 
on small jumps, because AJ^ 0. 

So, by using the statistic C/, which simply counts the number of large 
increments, defined as those greater than aAJ^, we are retaining only those 
increments of X that are not predominantly made of contributions from 

1/2 

its continuous semimartingale part, which are Op(An ), and instead are 
predominantly made of contributions due to a jump. The same heuristics 
work for more general Ito semimartingales. 

As we will see later, the key property of the functionals [/(tu, a, A„) is 
their convergence in probability 

(15) A-%(zz7,a)r^4' 

which we will show holds under Assumption 2. This property leads us to 
propose an estimator of /3 at each stage n. Fix < a < a' and define 

„ ,^ log(^(tI7,a)r/£/(t^,«0r) 

(16) Pn{t,w,a,a) = — , ^, , , 

log(a7a) 

which is at least consistent for estimating (3 on the set {At > 0}. If either 
value of U in (16)-(17) is 0, then, by convention, we set the estimator to be 
0. 

(3n is constructed from a suitably scaled ratio of two C/'s evaluated on 
the same time scale A„ at two different fixed levels of truncation of the 
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increments a and a'. In a way, this construction is in the same spirit as the 
classical estimator of Hill (1975), who conducts inference about the tails of 
a distribution based on ratios of various extremes. 
We can also propose a second estimator defined as 

(17) [3^{t,w,a) = 



tulog 2 

where U2{'^-, a)f is defined analogously to [/(tu, a)f in (14), except that sam- 
pling at A„ is replaced by sampling at 2A.„. That is, is constructed from 
a suitably scaled ratio of two C/'s evaluated at the same level of truncation 
a on two separate time scales A„ and 2A„. 

One could also look at a third estimator (3'^ obtained from two C/'s eval- 
uated at two different rates of truncation w and va' . One could further 
consider estimators based not just on counting the increments that exceed a 
certain cutoff but also on the magnitude of these increments, as in the case 
of power variations truncated to use only the large increments. 

In the rest of the paper, we will focus mainly on the properties of the 
estimator noting that a similar type of analysis yields the consistency 
and asymptotic distribution of the other two estimators. In general, the 
asymptotic variance of /3„ is smaller than that of (3'^ and 

Before studying the properties of the estimator /?„, let us make a few 
remarks. 



Remark 5. Asymptotically, as n — > 00, the above estimators behave 
well. However, for any given n, it may happen that they are not informa- 
tive, because too few increments are retained, up to the extreme case where 
U{w, a)" = U (ro, a')" = 0. If this is the case, one should take smaller values 
of a and a' . 



Remark 6. Even when the estimators are well defined, they may take 
a value bigger than or equal to 2. In this case, the estimation is not reliable, 
and it may be an indication that Assumption 2 is simply not satisfied, which 
would be the case for example if there is no jump at all in the observed path. 
So it would make sense to convince oneself that jumps are present [see, e.g., 
Ait-Sahalia and Jacod (2009)] before attempting to estimate (5. 

Remark 7. As we will see below, asymptotic considerations lead to the 
selection of = 1/5 as a universal choice valid for all possible values of (3. 
The cutoff for large increments is aA^. When implementing the estima- 
tor in practice, in any given sample the value of A„ is fixed, so w and a 
are not independent parameters. The level of truncation a may be set in 
relation to the volatility of the continuous part of the semimartingale [i.e., 
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/q fj^ (is)^/^] since the objective is to eliminate the increments that are 
mainly due to the continuous part. The truncation level can be selected in 
a data-driven manner. Despite the presence of jumps, that volatility can be 
estimated using the small increments of the process, since 

[t/An] ^ t 

(18) ^ |A"Xpl||A^X|<aA-} — cflds 

i=l •'^ 

for any a > and tjj G (0, 1/2). We can then set the cutoff level a to yield 
a number of (estimated) standard deviations of the continuous part of the 
semimartingale. For the estimator a' can then be set as a multiple of a. 
These data-driven choices determine a range of reasonable values for (a, a'). 
One possibility is then to simply average the estimators f3n obtained for the 
values of (a, a') over that range. The parameters (a, a') effectively play a 
role similar to that of bandwidth parameters in a nonpar ametric analysis. 

Remark 8. The construction of the estimators relies on the property 
(15), which holds under (slightly) weaker assumptions than Assumption 2, 
provided that the definition of At and the rate of convergence are suitably 
amended. As a result, the estimators given in (16) remain consistent under 
weaker assumptions. For example, when X has only finitely many jumps, 
the index is /? = 0, and U{w, a)" converges to the number of jumps between 
and t, irrespective of the value of a, so /3„ is equal to for all n large 
enough (obviously, this rules out the possibility of a central limit theorem). 

Remark 9. Our estimator is based on the count of "big" increments, 
although we are interested in the properties of the "small" jumps of X, 
which are those governing the index /3. This is because the behavior of sums 
of the squares of the small increments behave as described in (18), and other 
powers smaller than 2 are also driven by the "Wiener part" of X and do 
not provide insight on the small jumps. Perhaps considering sums of powers 
bigger than 2 for "small" increments would provide an alternative means of 
constructing estimators of /?, but we did not consider this possibility here. 

2.4. Properties of the estimators. Our first result states that our esti- 
mators estimate (3 on the (random) set {At > 0}, where the jump activity 
index is (3, and we can state the following rate of convergence. 

Theorem 1. Let < a < a' , < w < 1/2 and t>0. Under Assump- 

^ IP — 

Hons 1 and 2, we have /3(j^(t,tx7,a,a') — > P on the set {At >0}. Moreover^ 
if 
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(19) 

then the estimators f3n{t,'(jJ,a,a') are A^~^-rate consistent for any e > on 
the set {At > 0}, in the sense that the sequence of variables {-r^iPnit, «') ^ 
P))n>i is bounded in probability (or, "tight") in restriction to this set. 

The number x is positive, but it may also be very small. If we want 
an associated distributional result, we need stronger assumptions, which 
essentially implies that x = to/?/2 above, and this requires that the activity 
indices /3' and /3 — 7 of the "nonstable-like" part of the Levy measure be 
sufficiently apart from the leading activity index /3, as follows. 

Theorem 2. Let < a < a' and t > 0. Assume Assumptions 1 and 2 
with P' £ [0,p/{2 + P)) and 7 > P/2. Then, if w < 1/(2 + (5) A 2/(5/3), and 
in restriction to the set {At > 0}, we have the following stable convergence 
in law to a centered normal variable independent of X : 

(20) -4mCPn{t. ^' «' «') - ^ (O' TTT-T^) • 

i^n ' ^ At{\og[a' / a)Y J 

The qualifier "in restriction to the set {At > 0}" is essential in this state- 
ment. Recall that, unlike the usual convergence in law, stable convergence in 
law makes it possible to restrict the convergence to a subset of O exactly as 
convergence in probability does. On the complement set {At = 0}, anything 
can happen. On that set, the number P has no meaning as a jump activity 
index for X on [0,t]. 

Moreover, the stable convergence in law allows for the convergence of 
standardized statistics. 



Theorem 3. Under the assumptions of Theorem 2, the variables 

(2^) l^ mi > a1"^^ 1 A ^ ^' - ^) 

Vl/C/ (w, a', A„)t - \l\J{w, a, A^jt 

converge stably in law, in restriction to the set {At > 0}, to a standard 
normal variable Af {0,1) independent of X. 

These results are model-free in a sense, because the drift and the volatil- 
ity processes are totally unspecified apart from Assumption 1, and the Levy 
measures Ft are unspecified, other than the requirements specified in As- 
sumption 2. These three theorems will be proved in Section 8 below. 
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The restriction on w given in the statement of Theorem 2 restricts ad- 
missible values of Ci7 in a manner that depends on (5. Since /3 is unknown 
at this point, we must select a "universal" value of w that is admissible for 
all values of /3. Not surprisingly, the most stringent value of w is obtained 
in the limit where (3^2, yielding w = 1/5, and this is the value we suggest 
for empirical applications. 

We note (without proof) that a similar set of properties hold for the 
second estimator (5'^ based on the ratio of U' s estimated at two different 
frequencies A.„ and 2A„, with (21) replaced by the standardized statistic 

Vl/C/(w,a,2A„)j - l/C/(ro,a, A„)j 

3. Stable processes. Here, we specialize the general results in an impor- 
tant special case discussing, in particular, the efficiency of the estimators 
of /3 we propose. In the special case of stable processes, the model is fully 
specified parametrically, and we can compare the properties of efficient para- 
metric estimators of (3 to those of the general estimators [in- 

Denote, by Y, a symmetric stable process with index (3 G (0,1/2). We 
study the two situations where Xt = Yt (the simplest of all since there is no 
continuous part) and Xt = ht + aWt + Yt, where o" > 0, 6 S K and TV is a 
Brownian motion. The Levy measure depends on a scale parameter ^ > 
and the index (3. It has the form 

Af3 

Fidx) = — — dx, hence 

2\x\^+P 

(23) _ ^ 

F{x):=F{[-x,xf) = -^ forx>0. 



X' 



The law of Y\ has an even density g and a tail function G{x) =P(|yi| > x) 
satisfying, as x — > oo [see Zolotarev (1986), Theorems 2.4.2 and Corollary 2 
of Theorem 2.5.1], 

(24) a{x) = ^ + o[-^), G(x) = A + o(^_L 

In both cases X = Y and X^ = ht + aWf + It, we obviously have As- 
sumptions 1 and 2, with Ft = Fj. = F not depending on {uj,t), and with 
F/' = F{" = 0, H = 17 X (0, cx)), /?' = 0, and ft{x) = and, finally, At{uj) = A, 
which is the constant in (23). Then, we can apply the previous results, which 
further hold on the whole set [because here At = tA > for all {t, uj)]. The 
results are much easier to prove in this special case, and also the requirements 
on w are significantly weaker, thus allowing for faster rates of convergence 
(the larger vj, the faster the convergence in Theorem 1). But these improved 
results are no longer model-free, since the structure of jumps is completely 
specified in this stable model up to the unknown parameters A and /?. 
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3.1. The case X = Y . Consider first the case where X has no continuous 
part. Then, the general results on /?„ can be improved to yield the following. 

Theorem 4. Assume that X = Y. Let <a<a' and w>0 and t>0. 
Then: 

(a) If w < 1/(3, the estimators ro, a, a') converge in probability to 



(b) If further w < 2/(3/3), we have stable convergence in law, over the 
whole set Q,, as described in Theorems 1 (with At = tA) and 3. 

Note that in part (a) of the theorem, the closer /3 is to 2, the stronger the 
constraint on the truncation rate w. 

These estimators are not, however, rate-efficient. To see this, one can recall 
from Ai't-Sahalia and Jacod (2008) that the parametric model in which one 
observes the values for zA„ <t\s regular, and its Fisher information 

for estimating (3 is asymptotically of the form 



for some constant Cp. We can thus hope for estimators that, after centering 



by j3 and normalization by ^log(l/ A„,) /V A„ are AA(0, 1/Cp), and, in fact, 

the MLE does this. 

Where is the loss of efficiency coming from? In order to compute our 
general estimators j3n, we are forced by the presence of a continuous part 
in X to discard a very sizeable portion of the data, which is the effect of 
truncating away the small increments of X. However, in this case, if somehow 
we knew from the start that there is no continuous part in X, then there 
would no longer be a need to do that. It is clear that better estimators of [3 
could then be constructed. 

And if, further, the law of Y has a fully-specified parametric form, as is 
the case here, then it would be possible to improve the estimators even more. 
In this example, we would simultaneously estimate (3 and A, but the rates 
would be unchanged and it would be even more model-dependent. So this 
is the kind of estimator that we do not want to use, since we have no hope 
of extending such an estimator to the general semimartingale situation (or, 
in fact, even to more general Levy processes than the stable ones). 

3.2. The case Xt = bt + aWt + Yt. We now study the situation where 
y is a stable process, but X now also contains a continuous part. The 
distributional properties of the estimators follow directly in this special case. 
Indeed, for this model, U{w,a)'i is essentially the same as, or close to, the 
number V{w, a)" of jumps of Y that are bigger than aA^ in the interval 



13; 



(25) 



log(l/A„) 



Cpt 
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[0, t]. But y(-ci7, a)" is a Poisson random variable with parameter Ct/a^ 
where C is a constant. Hence, 

(26) AP^V{w,a)^^C/a^, 

(27) ^^(A^-y(tx., a)r - C/a^) ^ M{0, C/a^). 

These properties carry over to U{w,a)2, and this leads to the following 
improvement to Theorem 1. 

Theorem 5. Assume that Xt = bt + aWt + Yf. Let <a <a' and w >0 
and t>0. Then: 

(a) If Tu < 1/2, the estimators f3n{t,vj,a^a') converge in probability to 

P; 

(b) If zu < l/{2 + P), we have the stable convergences in law, over the 
whole set ft, as described in Theorems 1 (with At = tA) and 3. 

The estimators (3n are again not rate-efficient, although they do come 
close. In fact, using the methods of Ai't-Sahalia and Jacod (2008), we can 
show that Fisher's information for estimating (3 at stage n satisfies 



(28) 



A{\og{l/A^)f~PI^ 



for another constant C'^. Furthermore, in the (partial) statistical model 
where we observe the increments provided, they are bigger than aA^ and 
discard all others (here a > and < < 1/2), Fisher's information now 
satisfies 

So, our general estimators are almost [up to a log(l/A„) factor] rate-efficient 
for the partial parametric statistical model. As for the "complete" model, 
the rate approaches the true rate by taking w close to 1/2, but we cannot 
take w bigger than l/{2 + (3), and since, in practice, [3 is unknown other 
than being less than 2, a universal choice may be tu = 1/4, which is less 
stringent than the choice = 1/5 required in the general case. 

4. General Levy processes. Let us now consider the case where X is 
a general Levy process. Its characteristics are of the form (2) with bt = b, 
at = CT and Ft = F deterministic and not depending on t. Then, Assumption 
1 holds. As to Assumption 2, it may or may not hold, but if it does it takes 
a slightly simpler form because then everything is independent of (w,t). In 
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particular, At = At for some constant ^ > 0. The two Theorems 1 and 3 
hold without modification, except that either {A^ > 0} = Q for all t > 0, or 
{At > 0} = for all t, in which case those theorems are void of content. 

What is important here, though, is that those results fail when the as- 
sumptions we made are not satisfied, even with such a simple probabilistic 
structure for X. 

In order to see why Assumption 2 is needed, let us consider a simpler but 
closely related statistical model. More precisely, suppose that we observe all 
"big" jumps of X up to time t; that is, AXs with \ AXs\ > aAj^ for all s <t. 
A priori, this should give us more information on the Levy measure than the 
original observation scheme where only increments (as opposed to jumps) 
are observed and only those bigger than aA^ are taken into consideration. 

In this statistical setting, the estimators (16) have no meaning, but we 
can replace f3n with 

(30) /?Jt,n7,a,a ) = ^-^^ , 

where we have set 

(31) U{zu,a)t =J2l{\AXs\>aA^}- 

s<t 

The estimators /3„ are of course only virtual since there is no hope of 
actually observing the exact jumps of the process. But, in the rest of this 
section, we study the behavior of the estimators (3^ in order to gain some 
insight on the necessity of making a restrictive assumption on the Levy 
measure Ft if one is to estimate p. We will see that such an assumption is 
needed even under these idealized circumstances. Set 

(32) 7„(w,a)=F(aA-). 



Lemma 1. Let 
(33) M(ro,a)r 

Then: 

(a) Each sequence of processes M{w, a)" converges stably in law to a 
standard Wiener process, independent of X ; 

(b) If a < a', all limit points of the sequence 7„(ro, a')/7„('a7, a) are in 
[0,1]. Further, if this sequence converges to 7, then the pair {M{w,a)^, 
M{w,a')^) of processes converges stably in law to a process {W,W ), which 
is independent of X and a 2 -dimensional Wiener process with unit variances 
1 and unit covariance -y/7. 



■.{U{w,a)t -7„(zi7,a)f). 
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Proof. The processes = M{w, a)" and M'" = M{w, a')" are Levy 
processes and martingales, with jumps going uniformly to 0, and with pre- 
dictable brackets 

V7ri(TO,a) 

Observe, also, that a'A^ > aA^; hence, 7n(ti7,a') < 7„(-a7,a). The remain- 
ing results then follow [see Jacod and Shiryaev (2003), Chapter VII]. □ 

Theorem 6. If a' > a and ^ 7 G [0, 1], then the sequence 

/Q/1^ r~f n/^fl ^ '\ log(7n(CT,a)/7n(ro,aO) ^^ 
(34) ^-fn{w,a')i^pjt,w,a,a) iog(a'/a) ) 

converges stably in law to an M{0, f(iog(Q//Q))ij ) variable, independent of X . 

This result is a simple consequence of the previous lemma, and its proof is 
the same as the proof of Theorem 1 once the CLT for the processes U{w, a)" 
is established, which we will do later. 

So, the situation seems generally hopeless. These estimators are not even 
consistent for estimating the activity index (3 F because of bias, and to 
remove the bias we have to know the ratio 7n,(^, ct')/7n(^5 ^) (or at least its 
asymptotic behavior in a precise way), and, further, there is no CLT if this 
ratio does not converge (a fact which we do not know a priori, of course). 

The major difficulty comes from the possible erratic behavior of F near 
0. Indeed, we have (6) with (3 instead of (31, but there are Levy measures 
F satisfying this, and such that for any r S (0,/?) we have xl^F{xn) — > for 
a sequence rE„ — > (depending on r, of course). If F is such, the sequence 
7n(CT, Q!')/7n(ro, a) may have the whole of [0,1] as limit points, depending 
on the parameter values ■ci7,Q!,a', and in a completely uncontrolled way for 
the observer. 

So, we need some additional assumption on F. Let us consider two as- 
sumptions (the second one is stronger than the first one). 

Assumption 3. F is regularly varying at 0, with index (3 £ (0,2). 

We have 

A>0. 

Theorem 7. (a) Under Assumption 3, we have (3^{t,zu,a,a') — > 



Assumption 4. 
(35) 

as X ^ 0, for some 
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(b) Under Assumption 4, the variables A„™^(/J„(t, ro, a, a') — f3) con- 



Proof. Assumption 3 implies that 7n(ro, a) ^ oo and 7„(ti7, a)/7n(ro, a') 
{a' /a)^ , so the previous theorem yields (a). Assumption 4 clearly implies 



and also 7n(ro, a) ~ A/a^ A'^^ , so (b) follows again from the previous theo- 
rem. □ 

It may of course happen that Assumption 3 or 4 fail and nevertheless 
the conclusions of the previous theorem hold for a particular choice of the 
parameters {w,a,a') or for a particular choice of the sequence A.„. But, in 
view of Theorem 6 and of the previous proof, these assumptions are necessary 
if we want those conclusions to hold for all choices of {vj,a,a'). 

Now, coming back to the original realistic problem, for which only in- 
crements of X are observed. Assumption 2, when Ft{uj,dx) = F{dx) for all 
{uj,t), is obviously stronger than Assumption 4, but not much more. The 
need of stronger assumptions for the original problem comes from the fact 
that although when we observe a "large" increment A^X it is with a high 
probability almost equal to a "large" jump. Nevertheless, the observation of 
this jump is blurred by the Brownian component and also by a sum of very 
small jumps. This fact is also the reason why we need some restriction on 
vj for the original problem, whereas, here, can be arbitrarily large. 

5. Small sample bias correction. By construction, we are forced by the 
presence of a continuous semimartingale to rely on a small fraction of the 
sample (i.e., those increments larger than aAj^) for the purpose of estimating 
p. As a result, the effective sample size utilized by the estimator f3n is small, 
even if we sample at a relatively high frequency. This situation calls for an 
analysis of the small sample behavior of the estimator. 

Such a small sample analysis is out of reach in general but it can be carried 
out explicitly for the model Xt = aWt + OYt studied in Section 3, where Y is 
a symmetric /3-stable process and is a Wiener process. Let g denote the 
density of Yi. Here, the process Y is standardized by E(e*"^*) = e"*'"'''/^, so 
the limit (3^2 corresponds to the standard normal density (f). 

One additional step in the expansion (24) yields, as x — > -|-oo. 




variable, independent of X. 




(36) 
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and, for the tail of the distribution, 

(37) G{x) =Fm\>x) = 2£^g{v)dv = ^ + 0^ + o{^^), 
where the coefficients of the expansion are 

(38) C/B = — sm — and df^ = sin(7r/3) . 



2tt \ 2 J ^ Svr 

This parametrization corresponds in terms of the general notation of the 
paper to 

(39) At = A = 29^0^ /p. 

Now, consider the tail probability Pn at the cutoff level aA^. The prob- 
ability Pn determines the limiting behavior of the Us, since U{vj, a)" ~ 
{t/An)Pn- We have 

+ 00 

g{v) dv (p{u) du 



So, with 



/;;G(^Ar"^(i->y-".))««)<<«. 



(40) 



a 



and u(f>{u) du = and u'^(f){u) du = l, we see, from (37), that 
(41) 

+ smaller terms. 

The behavior of P„ suggested by the leading term in the expression 
(2c^0^/(/3a^))A^~^^ is the one we have used to define the estimator /3n in 
(16) by exploiting the dependence of that leading term on a. The first correc- 
tion term (/3(/3-|- l)c7^/ (2a^))A^~^^ in (41) is due to the interaction between 
the Wiener and the stable processes, while the second {df^O^ /{2cfsa^))Al^~'^^ 
is due to the more accurate approximation of the tail of the stable process 
in (36) compared to the leading order term in (24). 
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To understand intuitively the need for the first term, suppose that the 
cutoff level corresponds to seven standard deviations of the continuous part 
of the semimartingale. There is very little probability that the Wiener pro- 
cess alone will generate an increment that large. On the other hand, when 
we count the increments due to the jump process alone, we are missing in- 
crements of the sum of the continuous and discontinuous parts where, say, 
the Wiener process is responsible for a one standard deviation move, and 
the jump process for a six standard deviation move, of the same sign. We 
are also missing increments where the jump process gives an eight standard 
deviation move and the Wiener process a one standard deviation move, of 
the opposite sign. The two effects partly compensate each other and indeed 
the term in u in (40) leads to an integral whose value is zero. But the next 
effect, in , leads to a net increase in the total number of increments that 
are larger than the cutoff when the interaction between the Wiener and 
jump processes is accounted for. 

Asymptotically, the first of the two correcting terms in (41) is the largest, 
since 1 ^ A^~^^ ^ A^~^^, but in small samples a large value of the scaling 
parameter 9 relative to a can make their magnitudes comparable. Using (41) 
at two different values a and a', we obtain 



This suggests a small sample bias correction for the estimator obtained 
by subtracting an estimator of the two correction terms on the right hand 
side of (42) from (3ri- As we will see in simulations below, the two correction 
terms are quite effective in practice. 

Further, we note that the two correction terms in (42), of respective orders 
A^~^^ and A,;^~^^, are asymptotically negligible at the rate An^^''^ at 
which the central limit occurs. This is due to the restrictions on the choice 
of w imposed by Theorem 2. Consequently, the bias-corrected estimator has 
the same asymptotic distribution as the original estimator. 

More generally, we have 



(42) 





(43) 




+ 
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To implement the bias correction in practice, we need to estimate the terms 
{1/ At) Jq Agag ds and (l/At) /q A"^ ds. In the case of a stable symmetric pro- 
cess, As = A and so (l/At) /q ^4^ = j4 = 26^cp/p. We can then replace 

(l/At) jQAgCTgds by {l/t) j^alds and use any standard estimator of the 
integrated volatility. In general, we have 

(44) _ 

(45) A-%(zu,a)r~ao^+ai^ + a2^, 

where A = /q Aga^ ds, Aa"^ = Jq Aga^ ds and A'^ = /q ds. We can estimate 
the unknown coefficients ao, oi and 02 in expression (45) by a straightfor- 
ward linear regression of AJ^'^C/(to, a)" on and For the 
purpose of running that regression, we use different cutoff levels a and com- 
pute the corresponding number of increments exceeding that level, U {zu, a)f 
and the first-stage estimate of (3. Given estimates of the regression coeffi- 
cients, we have a generalized bias correction procedure based on subtracting, 
from /3n, the terms on the right-hand side of 

(46) /5n-/3~T7r7r77rT - -0--70 + " 



log{a' /a) \ao\a^ a!'^ ) aQ\a^ a!^ 
evaluated at the regression estimates of oq, a\ and 02- 

6. Monte Carlo simulations. We now report simulation results docu- 
menting the finite sample performance of the estimator /3„ in finite sam- 
ples. We calibrate the values to be realistic for a very liquid stock. We use 
an observation length of T = 1 day, consisting of 6.5 hours of trading (i.e., 
n = 23,400 seconds). 

The averages and standard deviations of the estimator which is based 

on two different levels of truncation a and a', are reported in Table 1 for 

various values of /3 up to 1.5 and include a continuous (Brownian) part. The 

table reports the results of 5000 simulations. The data generating process is 

1/2 

the stochastic volatility model dXt = atdWt + dYt, with at = , dvt = 

K{r]-vt)dt + jvy^dBt + dJt, E[dWtdBt] = pdt, 77^2 = o.25, 7 = 0.5, k = 
5, p = —0.5. J is a compound Poisson jump process with jumps that are 
uniformly distributed on [—30%, 30%] and Xq = 1. The jump process Y is 
either a /3-stable process with 0= 1.5, 1.25, 1.0, 0.75, 0.5 and 0.25, or a 
compound Poisson process (which has finite activity and is marked /3 = 
in the table) with fixed jump size 0.10. The estimator is implemented with 
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Table 1 

Monte Carlo simulations of the estimator Pn based on two levels of truncation for 
P-stable processes and a compound Poisson process (l3 = 0) 



Sampling A„ 




1 sec 


1 sec 


1 sec 


1 sec 


5 sec 


tail probability 




0.25% 


0.5% 


1.0% 


2.5% 


1.0% 


/3 = 1.5 


sample mean 


1.52 


1.51 


1.50 


1.52 


1.53 




Sample stdev 


[U.ZO ) 


(^U.loj 


(0.13) 


[U.Uo ) 


[U.ZO J 




Asymp stdev 


(0.26) 


(0.18) 


(0 13) 


(0.08) 


(0 24) 


f3 = 1.25 


Sample mean 


1.27 


1.26 


1.25 


1.26 


1.27 




Sample stdev 


(0.23) 


(0.16) 


(0.11) 


(0.07) 


(0.19) 




Asymp stdev 


(0.23) 


(0.16) 


(0.11) 


(0.07) 


(0.19) 


13=1.0 


Sample mean 


1.01 


1.01 


1.00 


1.00 


1.01 




Sample stdev 


(0.19) 


(0.14) 


(0.10) 


(0.06) 


(0.14) 




Asymp stdev 


(0.19) 


(0.14) 


(0.10) 


(0.06) 


(0.14) 


13 = 0.75 


Sample mean 


0.76 


0.76 


0.75 


0.75 


0.76 




Sample stdev 


(0.16) 


(0.11) 


(0.08) 


(0.05) 


(0.11) 




Asymp stdev 


(0.16) 


(0.11) 


(0.08) 


(0.05) 


(0.11) 


13 = 0.5 


Sample mean 


0.51 


0.50 


0.50 


0.50 


0.50 




Sample stdev 


(0.13) 


(0.09) 


(0.06) 


(0.04) 


(0.08) 




Asymp stdev 


(0.13) 


(0.09) 


(0.06) 


(0.04) 


(0.08) 


13 = 0.25 


Sample mean 


0.25 


0.25 


0.25 


0.25 


0.25 




Sample stdev 


(0.09) 


(0.06) 


(0.04) 


(0.04) 


(0.05) 




Asymp stdev 


(0.09) 


(0.06) 


(0.04) 


(0.04) 


(0.05) 


13 = 


Sample mean 


0.01 


0.01 


0.01 


0.01 


0.02 




Sample stdev 


(0.02) 


(0.01) 


(0.007) 


(0.005) 


(0.01) 



a = 5r/, a' = Wr] and w = 0.20. Given rj and a, the scale parameter 9 (or 
equivalently A) of the stable process in simulations is calibrated to deliver 
the various values of the tail probability P(|Alt| > aA^) reported in the 
columns of the table; for the Poisson process, it is the value of the arrival rate 
parameter A that is set to generate the desired level of jump tail probability. 

In each row, the top number is the average value of the estimator /?„ across 
the simulations, after inclusion of the bias correction discussed in Section 
5, while the number below, in parentheses, is the standard deviation of the 
estimator across the same simulations. The third number in parentheses is 
the estimated asymptotic standard error based on the limiting distribution 
given in the sections above. A higher tail probability in the columns has the 
effect of generating more increments from the jump process that exceed the 
cutoff level, which makes more observations available and correspondingly 
reduces the standard deviation of the estimates. 

As the results show, (5n picks up on average fairly accurately the true value 
of p. As P gets too close to 2, the /3-stable jump process starts to approximate 
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Estimator Based on Two Truncation Levels 




C 0.2i 0,5 0,75 I 1,25 1,6 1,75 2 0,25 0,5 0,75 1 1.25 1,5 1,75 J 



Fig. 1. Monte Carlo distributions of the estimator I3n based on two levels of truncation 
for 13-stable processes (0 < P <2). 

too closely the behavior of the Brownian motion and the performance of the 
estimator deteriorates. 

Further simulations (not reported to save space) suggest that the esti- 
mator is not overly sensitive to the selection of the truncation levels {a, a') 
within a reasonable range. Histograms of the distribution of the estimator 
Pn sjce shown in Figure 1 for the same values of /?; the figures are based 
on the 1% level of jump tail probability. The histogram reports the raw, 
unstandardized, values of which are not expected to be asymptotically 
normal unlike the standardized versions. 
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Estimator Based on Two Truncation Levels 



2000 I- 



1500 ■ 



1000 ■ 



SOO - 



. m 1 J L J L L I J 

0.25 0.5 0.75 1 1.25 1.5 1.75 2 

Fig. 2. Monte Carlo distribution of the estimator /3„ based on two levels of truncation 
for a compound Poisson process (j3 = Q). 



A special mention should be made about the compound Poisson case, 
which corresponds to /? = and does not satisfy Assumption 2. As discussed 
in Remark 8, there is no central limit theorem in this case, and /3„ should 
be equal to for n large enough. Intuitively, when there is a small number 
or large jumps, the same large increments remain in the sample at the two 
truncation levels a and a' , and the ratio of U evaluated at a to U evaluated 
at a' in (16) is equal to 1. This is what happens in simulations in the vast 
majority of cases, as shown by the histogram for (3 = reported in Figure 2. 

The asymptotic distribution is an accurate guide for the small samples as 
shown in the standardized distributions in Figure 3. The estimator in the 
histograms is standardized according to the asymptotic distribution given in 
Theorem 3, and the solid curve in the figures is the limiting AA(0, 1) density. 
As the figures show, the asymptotic distribution is a fairly accurate guide 
for the small samples. This is in spite of the relatively small number of 
(large) increments that are effectively used by the estimator, combined with 
the facts that some large increments are kept, even though they may not 
have contained a large jump, or, conversely, smaller increments may have 
contained two or more large, cancelling, jumps, or the Wiener process may 
have combined with the pure jump process to produce a larger increment. 
Asymptotically, these effects do not show up at the leading order in A„ but 
are present in small samples and appear to be effectively captured by the 
bias correcting term. 

Finally, we compare in simulations the performance of the two estimators 
Pn (based on two truncation levels) and (3'^ (based on two sampling frequen- 
cies) with the same experiment design as above. The sampling frequency is 
An = 1 second, the length of observation T = 1 day or 23,400 seconds. The 
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Estimator Based on Two Truncation Levels 
P = 1,.'i |i=L25 




P = l 




p = 0.5 





15 = 0.75 




(5 = 0.25 




Fig. 3. Standardized Monte Carlo and asymptotic distributions of the estimator j3n based 
on two levels of truncation for p-stable processes (Q < (3 <2). 



tail probability is 1.0%, the middle value in Table 1. The results are shown 
in Table 2. The estimator tends to have a larger standard deviation than 
f3n and is slightly biased. For these reasons, we have focused on the estimator 
[in and emphasize its use in the empirical application that follows. 

7. Empirical application. We now implement the estimator /?„ for the 
two most actively traded stocks in the Dow Jones Industrial Average index, 
Intel (INTC) and Microsoft (MSFT), and each trading day in 2006. The 
data source is the TAQ database. Each day, we collect all transactions on 
the NYSE or NASDAQ, from 9:30 am until 4:00 pm, for each one of these 
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Table 2 

Comparison of the two estimators of j3 in Monte Carlo simulations for P-stahle processes 
and a compound Poisson process (P — 0) 











Two truncation levels 

/3„ 


Two sampling frequencies 




1.0 


Sample 


mean 


1.00 


1.00 






Sample 


stdev 


(0.10) 


(0.13) 


p= 


0.5 


Sample 


mean 


0.50 


0.51 






Sample 


stdev 


(0.06) 


(0.09) 


p= 





Sample 


mean 


0.01 


0.03 






Sample 


stdev 


(0.007) 


(0.02) 



stocks. We sample in calendar time every 5 and 15 seconds. We use filters to 
eliminate clear data errors (price set to zero, etc.) and all transactions in the 
original record that are later corrected, cancelled or otherwise invalidated, 
as is standard in the empirical high frequency literature. 

The two time series are plotted in Figure 4. Figure 5 contains a histogram 
of the tails of the unconditional densities of the log-returns from the two 
stocks. Comparing the two figures, we see that it is quite possible to have 
a standard time series plot display little evidence of large moves (Figure 
5), while the tails of the distribution look substantially fatter than normal 
(Figure 5) as confirmed by the descriptive statistics in Table 3 for the two 
log-returns series. All together, this evidence points in the direction of many 
small, active jumps of the type that we seek to uncover using our estimator 

of ;3. 

More formally, we compute the statistic Sn of Ai't-Sahalia and Jacod 
(2009) to test for the presence of jumps in the data. Over the different 
sampling frequencies considered (ranging from 2 seconds to 1 minute), the 
largest value of the statistic Sn we obtain for the different quarters and the 
two stocks is 1.19. Since the asymptotic value of Sn is 1 (resp. 2) when jumps 




Jan Apr Jul Oct Dec Jan Apr Jul Oct Dec 



Fig. 4. Time series of INTC and MSFT stock prices, all trading days in 2006. 
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Tails of INTC Log-Retums Density 
11 




Tails of MSFT Log-Returns Density 
r 
t 
I 
I 
I 




■A 



Fig. 5. Tails of the marginal density of INTC and MSFT log-returns, all trading days 
in 2006 sampled at 5 second intervals. The dashed curve is the leading part of the Levy 
measure, that is F-[{dx) given in (8), with jump activity index (3, estimated for each stock 
by (3n. 

are present (resp. absent), this provides further evidence for the presence of 
jumps. 

Table 4 reports the estimates produced by /3„ for the two stocks, for 
the fuh year. For each stock and sampUng intervals A„ of 5 and 15 seconds, 
respectively, the left columns in the table use a level of truncation a, which is 
set to 7 estimated standard deviations of the continuous part of the process, 
a second level a' = 2a and a truncation rate w = 0.20. In light of the different 
possible choices of a and a' , a simple variance-reducing procedure consists 
in averaging the estimators obtained over the different {a, a'). The columns 
marked AVG report the results of averaging the estimates over values of a 
ranging from 7 to 9 estimated standard deviations of the continuous part of 
the process and over values of a' ranging from 1.5 to 4 times a, as discussed 



Table 3 

Descriptive statistics for Intel and Microsoft transactions in 2006 





Mean 


Stdev 


Skew 


Kurt 


Min 


Max 








INTC 








Qtr 1 


-2 X 10"*^ 


0.00069 


-33 


5195 


-0.109 


0.019 


Qtr 2 


-3 X 10-'' 


0.00070 


-0.1 


136 


-0.027 


0.027 


Qtr 3 


6 X 10"'^ 


0.00071 


-0.4 


93 


-0.027 


0.022 


Qtr 4 


-1 X 10"^ 


0.00065 


-0.5 


110 


-0.021 


0.030 


All year 


-5 X 10"^ 


0.00069 


-8.5 


1430 


-0.109 


0.030 








MSFT 








Qtr 1 


3 X 10"^ 


0.00050 


0.1 


190 


-0.025 


0.028 


Qtr 2 


-1 X 10"® 


0.00065 


-49 


8927 


-0.119 


0.021 


Qtr 3 


1 X 10~® 


0.00053 


7.3 


749 


-0.017 


0.051 


Qtr 4 


7 X 10"'^ 


0.00051 


-0.2 


388 


-0.031 


0.031 


All year 


2 X 10"'' 


0.00055 


-19 


4715 


-0.119 


0.051 
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Table 4 

Estimates of (3 from all Intel and Microsoft transactions in 2006 



INTC MSFT 





5 


sec 


15 


sec 


5 


sec 


15 


sec 


OL 


7 


AVG 


7 


AVG 


7 


AVG 


7 


AVG 


OL 1 a. 


2 


AVG 


2 


AVG 


2 


AVG 


2 


AVG 




1.43 


1.56 


1.76 


1.72 


1.69 


1.62 


1.60 


1.61 






1.52 




1.69 




1.60 




1.59 




(0.04) 


(0.003) 


(0.05) 


(0.006) 


(0.05) 


(0.004) 


(0.05) 


(0.005) 



above in Remark 7. /3„ is the estimator defined in (16). The estimator /?„ in 
the table denotes the estimator obtained by applying to the regression- 
based bias-correction procedure described in Section 5 implemented using 
the same range of truncation levels as for the average AVG. Standard errors 
of the estimators are in parentheses. 

We find evidence of infinitely active jumps. The estimated values of /? are 
about 1.5 for INTC and 1.6 for MSFT. These correspond also to the average 




1.5 1.75 2 2. 25 2.S 2,75 

Truncation ^tio a '/a 



1-5 1.75 2. 2.25 2,5 2.75 J 
Truncation Ratio ci'/a 



MSFT, a = 10 seconds 




2 2.25 2.5 2.75 

Truncation Ratio a'/a 



2 2.25 2.5 2.75 J 

Truncation Ratio cr'/a 



Fig. 6. Estimates of [3 obtained from fin at 5 and 15 seconds for Intel and Microsoft, 
2006 transactions. 
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Degree of Jump Activity, 2006 
Time Series of Bi-Weekly Estimates 



IWTC, = S seconds 



HSPT, A = 5 seconds 




Apr Jul 
Month 



Oct Dec 



Fig. 7. Time series of the estimated j3 for Intel and Microsoft, all 2006 transactions, 
computed at the 5 second frequency on a bi-weekly basis. 



values Pn that we obtain if we average the four quarterly estimates, and also 
average them over different values of a and a' /a. Clearly, the nature of these 
infinitely active jumps cannot be assessed by mere visual inspection of the 
time series in Figure 4, as would have been the case if only large, infrequent 
jumps, were present. 

Figure 6 shows the values of the estimates computed over the full year for 
various values of a (the four curves on each plot correspond, resp., to 6, 7, 8 
and 9 estimated standard deviations of the continuous part of the process) 
and values of a' ranging from 1.5 to 3 times a (the horizontal axis represents 
the ratio a' /a). The continuous standard deviations are computed from the 
small increments of the process and updated each quarter. The figure shows 
that, for each stock, we obtain comparable values for (3 when varying a and 
a' . The top panel of the plot shows the estimates at A„ = 5 seconds, the 
bottom one at A„ = 15 seconds. As expected, the lower frequency estimates 
are more variable, since they rely on a smaller number of large increments 
in total, and the variance of Pn is proportional to the difference between one 
over the number of increments retained at the two cutoff levels. 

Figure 7 plots the time series of the estimated f3 for Intel and Microsoft 
computed using a time interval T of two weeks during year 2006 at the 5 
second frequency. The estimates are the AVG estimates (averages over the 
values obtained for a range of a and a') discussed in Table 4. The figure 
shows that the bi-weekly estimates, while variable over the course of the 
year, tend to be clustered around the same value of 1.5 as the full-year 
estimates. Shorter lengths of observation than two weeks tend to produce 
substantially more variable estimates, due to the small number of large in- 
crements available. 

Allow us one last comment regarding the high sampling frequency and 
its interaction with market microstructure noise in the data. There is no 
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doubt that market microstructure noise is a concern at sampling intervals of 
a few seconds. However, in the present context, we are only using the large 
increments of the process, namely those greater than aA^. Such increments 
are more likely to represent a true price movement rather than noise, espe- 
cially after the standard data cleaning consisting of removing price errors 
have been applied, whereas the noise most likely mainly corrupts the obser- 
vations by a relatively small amount. Similarly, the cutoff points are beyond 
the level of bid/ask bounces, or the tick size of 1 cent. 

Furthermore, all we are doing is counting those increments, not exploiting 
their magnitude. So, provided that the noise does not substantially affect 
the likelihood that an increment is above or below the cutoff, these estimates 
are unlikely to be seriously affected by the noise. But this paper represents 
only a first attempt at measuring the degree of jump activity. We leave to 
future work the development of estimators that are fully robust to market 
microstructure noise. 

8. Technical results and proofs. Our basic semimartingale X satisfies 
(2) and Assumptions 1 and 2. Note that all three Theorems 1, 2 and 3 
are "local" in time, so by a standard localization procedure [see, e.g., Ai't- 
Sahalia and Jacod (2009)] we can indeed assume a strengthened version of 
our assumptions, which follow. 

Assumption 5. The processes h and a are bounded by some constant 

L. 

Assumption 6. We have Assumption 2 with Lt{uj) = L being constant. 

In all the sequels, the letter K denotes a constant that changes from 
line to line and may depend on X and its characteristics and also on the 
parameters vu, a, a' , and we write Kp if we want to emphasize its dependency 
on some other parameter p. We use the shorthand notation and Pf_i, 
respectively, Et and Pt or the conditional expectation and probability with 
respect to .^(i-i)A„) respectively J^f 

Before proceeding, we mention a number of elementary consequences of 
Assumption 6, to be used many times. For all u £ (0, 1] and v E (0, 2] and 
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x,y G (0, 1], we have [recall the notation (4)] 



Ftix) < 4> 



FAx) - h 



K K 



[ x^Ft{dx)<Ku^-^, 

J{\x\<u} 



(47) 



' {\x\'' Al)Ft{dx)<l K„log{l/u), iiv = p, 

Ft{x) - + y) < ^ f 1 A ^ + x^^(^-^')) . 
L xP \ X / 



8.1. Estimates for stable processes. In this subsection, we consider a 
symmetric stable process Y with Levy measure (23). For each 5 G (0,1], 
we set 



(48) 



>&)}■ 



s<t 



Lemma 2. There is a constant K, depending on {A, 13), such that, for 
all s > 0, 



(49) 



\Y{8)',\>5/2)<Ks^l''/5^f^/\ 



Proof. We use the notation (23) and (24). We set 9 = sF{6/2) = sA{2/S)^ 
and consider the processes Y' = Y{6)' , Y" = Y{5/2)' and Zt = J2r<t ^{\AYr\>5/2}- 
Introduce, also, the sets 

D = {\Y,\>6/2}, D' = {\y:\>5/2}, 

B = {Zs = l}, B' = {Zs=0}. 

It is of course enough to prove the result for s/6^ small, so, below, we assume 
6'<l/2. _ 

By scaling, F{D) = G{6s^^/f^ /2), so (24) yields 

(50) \w{D)-e\<Ke'^. 

Next, Zg is a Poison variable with parameter 6 < 1/2. Hence, 

(51) \¥{B)-e\<Ke'^. 

Since Y" is a purely discontinuous Levy process without drift and whose 
Levy measure is the restriction of F to [—(5', 5'], we have 



(52) 



E((y;')^) = s I x^F{dx) < K96\ 

J{\x\<5/2} 
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The two processes Y" and Z are independent, and conditionally on B the 
law of the variable Zg is F restricted to > 5/2} and normalized by 6. 
Hence, 

P(B n D^) = e-^s f F(dx)¥(\Y'' + x| < - 

J{\x\>^} \ 2 

< e-^s [ F{dx)¥(\Y''\ > |x| - - 

J{\x\>5/2} V 2 

(53) <,(^fQ^<|^|<^(1 + 01/3) 

+^({N>^})F(ln1>V/3 

<K9(9'/^ + ^EiiY:r))<Ke^/^ 

where we have used (23) and (52) for the last two inequalities. 
Now, we have 

F{D n S^) = F{D) - F{B) + F{B n D""). 
Observe, also, that DOB' = D'OB', and D' and B' are independent. Hence, 



because F{B') = e ^ > e ^^"^ . The last two displays, plus (50) and (51) and 

(53) give us F{D') < K9^/'^. Hence, (49). □ 

8.2. Estimates for semimartingales. Below, we assume Assumptions 5 
and 6 without special mention. The semimartingale X can be written as 

X = Xo + B + X'' + (a;l{|^|<i}) {fi - u) + (j;l||^|>i}) * fi, 

where X^ is the continuous martingale part and /x is the jump measure, B 
and v are as in (2), and the stands for the stochastic integral with respect 
to random measures [see Jacod and Shiryaev (2003)]. For any 5 £ (0, 1] we 
set 

(54) <^ s<t 

[ Xi6y = X- XiSy = Xo + B + X^ + (xl{|,|<5|) Bi6), 

where 

(55) B{6)t= f\{6)sds, b{6)s= f xF^idx). 

Jo •'{S<\x\<l} 
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By Assumption 6, F[ and F[' are mutually singular. Hence, there exists 
a predictable subset <I> of x (0, oo) x M such that 

F/(u;, •) is supported by the set {x : (w, t, x) ^ $}, 
Fl'{u>, •) is supported by the set {x: {u>,t,x) G 

Observe, also, that if /?' < 1, we can set 

ft 



(56) 



(57) 



B{S)[= / b{5lds, 6(5); 



{<5<|x|<l} 



xF!^'{dx) 



and |6((5)'s| < K. In this case, 3(6)' = {xl^ix\<S}^^) is of finite variation, 
and so is (a;l|[2,|<5|l$) Therefore, we have the decomposition 



(58) 



X{6y = Xo + X + XiS)"" + X{df - B{6), where 

^.^xa ^ [ (^1{|^I<'5}1*)*/^' if /?'<!, 

\{xl{\x\<5}U)*{f^-i^), if /?'>!, 

X{6f = (xl{|^|<5}l<i,c) * (/X - u), 



X 



B + X"^ 
B + X^ 



B{5)\ 



if 13' < 1, 
if /3' > 1. 



Lemma 3. We have for all 6 £ {0,1], p> 2, s,t> 0; 



(59) 



(Ks, 

(a) \B{S)t+s - B{5)t\ < \ Kslog{l/S), 

(b) ¥.t{\Xt+s-Xtr)<KpsV/\ 

(c) M\X{5)\+s-X{5ft?)<Ks5^-^ 

Ud) M\X{6)t^s-X{5)'if)<Ks. 



if(3<l, 



Proof, (a) follows from (47), and (b) follows from Burkholder-Davis- 
Gundy inequality and Assumption 5. By (47), again, we have 



U\X{6)l,-Xi6t\')=Et^l 'dr j^^ 

which is (c). For (d), we single out the two cases /?' < 1 and (3' > 1. In the 
first ca! 
we get 



>{\x\<5} 



first case, and since for any sequence {xm) we have | Yl,m^rn\^' — l^^ml^'j 



<eJ( j2 \Axi6):.A^] 

\ \t<r<t+s ) ) 



dr 



{\x\<&} 



xf\^{r,x)F'y(dx) ) <Ks. 
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In the second case, we apply Burkholder-Davis-Gundy inequality with the 
exponent 1 < /?' < 2 to get 



E{\X{6)^^,-X{6) 



\ \t<r<t+s / 
\t<r<t+s ) 



which, exactly as before, is smaller than Ks^ and (d) is proved. □ 

Next, we give a general result on counting processes. Let be a counting 
process (i.e., right continuous with A'^o = 0, piecewise constant, with jumps 
equal to 1) adapted to {Ti) and with predictable compensator of the form 
At = /o Ads. 

Lemma 4. With N and A as above, assume further that Xt <u for some 
constant u> 0. Then, we have 

(60) \¥tiNt+s -Nt = l)- EtiAt+s -At)\+ ^tiNt+s -Nt>2)< {us)\ 

Proof. Let Ti,T2,... be the successive jump times of A^ after time 
(i — 1)A„. We have 

FtiNt+s -Nt>l)< Et{Nt+s - Nt) = Et{At+s - At) < us, 
Ft{Nt+s -Nt = l)= Et{N^t+,^^T, - Nt) 

= Et{A^t+s)An-^t), 
Et{At+s - At) - Ft{Nt+s -Nt = l)= Et{At+s - \t+s)An) 

<us¥t{Nt+s-Nt>l) < ius)\ 

This gives us the first estimate. Next, 

Ft{Nt+s -Nt>2) 

= Ft{T2<t + s) = Et{l^T,<t+s}nT2 <t + s\TT,)) 

»(i+s)AT2 



■■Et[l{T,<t+s]^^j^ Kdr^T, 



Ti 

<us¥t{Nt+s-Nt>l)<{us)\ 
hence the second estimate. □ 
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Lemma 5. With the notation N{6)t = J2s<t ^{\AXs\>d}> /^^ '^^^ ^ ^ (0, 1]; 
C £ (0, ^) and p>2 we have 

FtiNi5)t+s - N{6)t > 1, \Xi5)[^, - Xi5)[\ > SQ 

(61) 

Proof. (59) and Bienayme-Tchebycheff inequality yield 

Xt+s - Xt\ > — 1 < K„^^. 
t+s ti 3 y - p^pSp 

Next, (59) yields \B{5)t+s — B{6)t\ < Kqs/6 for some constant Kq, so we 
have either \B{5)t+s - B{6)t\ < 6C/3 or KsPl'^j^'b^ > 1 for some constant K 
independent of (" and 5. Then, it remains to prove that, with the notation 
M{6) = (xl{|3.|<5}) ★ (/i - i^), 

(62) Ft(N{6)t+s - N{S)t > 1, \M{6)t+s - M{6)t\ >^)<K 



For simplicity, write Ns = N{5)t+s - N{8)t and Ms = M{5)t+s - M{6)t. 
By Bienayme-Tchebycheff inequality, the left-side of (62) is not bigger than 
9E,t{NsM^)/6'^C'^ . Now, is a counting process and M is a purely discon- 
tinuous square-integrable martingale, and they have no common jumps, so 
Ito's formula yields 



N. 



Ml = 2 r Nr^Mr- dMr + f dNr + V iVr- (AM, 
^0 Jo 



Moreover, the compensator N is as in the previous lemma, with As < K6~^ , 
and the predictable quadratic variation of M is = Jq dr with A'^ < 
K6'^~^ by (47). Then, taking expectations in the above display, and since 
the first term of the right side above is a martingale, we get 

EtiNs Mf ) = Et (^^' dAr + Nr dK^j 

< K6~^ r Et{Ml + 6^Nr) dr 
Jo 

= K5-^ r EtiK + S^K) dr < K6^^'-fh\ 
Jo 

(62) then follows. □ 

Lemma 6. Let a> 0, G (0, ^) and i] G (0, ^ — w), and set 

(63) p = 7? A {w{[3 - (3') - (3'ri) A {lu-f) A (1 - - 2r]). 
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There is a constant K depending on [a^w^rj), and also on the characteristics 
of X , such that for all s G (0, A„] and t>0 we have 

(64) Ft{\Xt+s-Xt\>aA^)-Et^'^'FriaA^)dr^^ 



(65) Pt(aA- < \Xt+s -Xt\< aA-(l + A;^)) < KAi~^^+P, 

(66) Ft{\Xt+s - Xt\ > aA^) < KAi— ^. 

Proof. It is clearly enough to prove the estimates for all A„ small 
enough. We write 6n = ctA'^ . 

(1) Apply (59) and Bienayme-Tchebycheff inequality to obtain 

¥t{\Xt+s - Xt\ > aA-+73) < i^pA^^(i-2--2'')/2, 

FtmSn)^+s-XiSn)1\ > aA-+73) < KAi-^P'-^P'. 

Moreover, for all values of /3, we have |i?((5„)t+s — i?((5„)j| <KAn^^^ x 
slog(l/s), which is smaller than aAj^+^/S as soon as A„ is small enough 
because t] <1 — vj(3. Since 1 — 2w — 2r/ > 0, by choosing p large enough, we 
deduce from the previous estimates that, for all A„ small enough, 

¥t{\X{5n)'t+s + X{5n)\+s - {X{5n)'t + X{6n)\)\ > aA^T^") 

(67) 

— n 

(2) By Assumption 6, we have F^{dx) < {L' /\x\^^^) dx for some con- 
stant L' . We fix n. For each (j S we endow the canonical (Skorokhod) 
space (J-'D) of all cadlag functions on M+ starting from with the 
(unique) probability measure Q2, under which the canonical process X' is 
a semimartingale with characteristics (0,0, ^), where 

(68) i^s^^^^{dr,dx) = drl{\^\<s„}(^j^YT^ 

This measure does not depend on uj'. Hence, under the process X' 
has independent increments, c^idr, dx) depends measurably on oo. Hence, 
Q'2j{duj') is a transition probability from (O,^) into (0',^'). Then, we ex- 
tend X, X' and other quantities defined on or Vt' in the usual way (with- 
out changing the symbols) to the product = x f]' endowed with the 
product (T- field JT, the product filtration [!Ft) and the probability measure 
¥n{dLO,duj') = F{duo)Ql{dio'). 

Because of (68) and (47), and as in Lemma 3, EQn(|X^'^^ — X[\'^ \ T[) < 
KsSl'l^. Then, we see that, for some constant C depending on a and j3 but 
not on n, i and we have 

(69) QZ{\X[+, - X[\ > aA-+^ I T[) < CA^-^-2^. 



36 Y. AIT-SAHALIA AND J. JACOB 

(3) By well known results on extensions of spaces [see, e.g., Jacod and 
Shiryaev (2003), Section II. 7; note that the present extension of the original 
space is a "very good extension"], X' is a semimartingale on the extension 
with characteristics (0,0, f^^), where v'^^{{uj,u}')^dr,dx) = u'^^ ^{dr,dx) and 
any semimartingale on the original space is a semimartingale on the exten- 
sion with the same characteristics. Moreover, X and X' have almost surely 
no common jump, so the sum y (5„)' = X{6n)'' + X' is a semimartingale with 
characteristics (0,0, i^^'^), where 

v'sMr^dx) = drl{\^\^Sr,}K{dx) + u's^{dr,dx) = l{|x|<5„} drdx, 

where the last equality comes from (68). It follows that Y{5n)' is a Levy 
process with Levy measure given above, or in other words it is a version 
of the process Y{5n)' of (48) with A = L' . Hence, we deduce from (49) and 
from the Levy property of Y{5n)' that, for any A S 

(70) P„(^n{|y(Ot+.-m);i>aA-(l-2A:i)})<KAy3-4-/3/3ip(^). 
Next, for all A„ small enough, so that CA^~™'^~^'' < 1/2, we can write 

P(A n {\Y{5jt+, - Y{6M > aA-(l - 2A;1)}) 

> P(A n {\X{Sn)1+s - X{6n)1\ > «A-(1 - Al)} 

n{|x;+,-x;i<aA-+n) 

> ip(.4 n {\X{5n)Us - I > «A-(1 - Al)}), 

where the last inequality comes from (69). Then, by (70) and by the fact 
that A is arbitrary in ^(j_i)A„) we deduce (since necessarily p < ^—^) that 

mX{6n)U,-X{dn)1\ > aA-(l - A;^)) < KA:^/3-4-/3/3 < i^A^^ ^+''. 

In turn, combining this with (67), we readily obtain that, for all A„ small 
enough, 

¥ti\XiSn)Us-XiSnyt\>aA^) 

(71) < K A^-"^( Ay3-^/3/3 + ^^(/3-/3')-2,,) 

(4) Now, we write 5^ = aA^(l + A^^) and also 

9t,s = Et F,(<5„) dr^ , 9i^, = Et (^j'^^^FM dr^ , 
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and the following two counting process 

= 1{|AX,|>5„}, = H 1{|AX,|><5;}- 

s<t s<t 

Their predictable compensators are Jq Fr{5n) dr and Jq Fr{S'n) dr, whereas 
both Fr{6n) and Fr{6[J are smaller than i^T/A^^. Hence, (60) gives 

PtiK+s - = 1) - ^t,s\ + iPi(ivr+. - ivr > 2) < kaI^^-^^\ 



't{Nii, - = 1) - < kaT'^'^'^. 



(72) 

|Ft(iV 

Since N"' — N'" is nondecreasing, we have 

¥t{N^^,-N^ = l,Ni%-Nl^ = 0) 

= ¥tiN,% - iV* = 1) - FtiN^l, - Nj = 1) 
+ (iV,% - K > 2, iV/!^, - = 1). 

Then, (72) yields 

(73) |Pi(iVf+, - iVf = 1, N^l, - Nr = 0) - {Ot^s - < i^A^d— 
Moreover, (47) clearly implies ^t,, - e^^, < i^Ai-^^^^^j, ^ ^^(7A(/3-/3'))) < 

We then deduce from (73) that 

(74) Pt {N"^^^ -K = l, Nil, - iV/" = 0) < KAl^^f^+P. 

(5) If Afj"^, - = Nl% - Aff = 1 and \Xt+s -Xt\<6n, then, necessar- 
ily, iXiSnYt+s - Xi^nYtl > aA-+''. Hence, 

Pt(iV,'V, - iVf = 1, \Xt+s - Xt\ < aA-) 
< Pt (iV-^, - = 1, Nl^s - Nt = 0) 

+ Pt(iV,% - = 1, iXiSnYt+s - > aA^+^). 

Then, if we apply (61) with p large enough and 5 = 5n and Q = A^, together 
with (74), we deduce that, as soon as A„ is small enough, 

(75) Pi(iVf+, - = 1, \Xt+, -Xt\< aA^) < KAI~^^+p. 
Finally, Xt+s -Xt = X(5„)J+, - X(5„)J on the set {Np_^, - iVf = 0}, so 

Fti\Xt+s -Xt\> aA-) = Pt(iV,% - iVf = 1) 

- Pt(iVr+, - = 1, \Xt+s -Xt\< aA-) 

+ FtiK^, - iVf = 0, iXidnYt+s - > aA-) 

+ FtiiK+s - K > 2, |Xt+, - X^l > aA-). 

Then, if we combine (71), (72) and (75), we get that for all A„ small enough, 
we readily obtain (64). We also trivially deduce (66) from (47) and (64). 
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(6) Finally, a close look at the previous argument shows that (64) also 
holds with aA^(l + A^) and Ot^si' in place of aA^ and 9t^s- Therefore, (65) 
follows upon using the property 0t,s — 0^ ^ < KAj^^'^f^'^f proved above. □ 

Lemma 7. Under the assumption and with the notation of Lemma 6, 
and if M is a bounded martingale, we have (with K depending also on M, 
recall s < An) 

(76) < KAi-^f^+P + KAi~'^^+"^(^Et{\Mt+s - Mt\) 



+ KA(,i-(-+'')^)/VlEt(|Mj+, - MiP). 



Proof. (1) There exist C functions /„ such that 

1{|x|>qA-(1+2A;1/3)} < fnix) < '^{\x\>aA^ (l+A'l/S)} 
^^^^ ^\f'n{x)\<^, I/:(X)|<^^ 



With X' = X — B — X'^, and since M is bounded, we have 

(78) < KFtiaA^ < \Xt+s - Xt\ < aA^(l + A;^)) 

+ KKt{\fn{Xt+,-Xt) - /„(!;+, -xoi). 

Now, we have 

\fnix + y)- fnix)\ < l{|y|>QA^+''/3)} + * 1 2^1 1{qA-<|z+?/| <aA-(l+A;i)} ■ 

If we apply this with x = X^j^, - X[ and y = (B + X^)f+, - (B + X'=)t, 
plus (59) (b) for p large enough and Bienayme-Tchebycheff inequality and 
1 — 2w — 2?7 > 0, plus (65) and (59)(b) again and Holder's inequality, we 
obtain that the right side of (78) is smaller than A'A^~^^"^^. Therefore, it 
remains to prove that 

\Etmt+s-Mt)fniX[^s-X't))\ 

(79) < KAl,~^^+P + KA^-(^+'')^Ej(|Mt+, - Mt\) 



+ KA(,i-("+'')^)/%/Et(|Mt+, - Mt|2). 



(2) According to Theorem III.4.20 of Jacod and Shiryaev (2003), we can 
"project" the martingale M onto the random measure /i, which amounts 



(80) 
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to decomposing it as the sum of two martingales M = M' + M" with the 
fohowing properties: 

M' = 6-k [fj, — v) for some predictable function 6, 
J2s<t'Pi-^^ ^■^s)^M'J is a martingale as soon as 
(p is a predictable function satisfying \(p{u!,s,x)\ < K{1 A |x|), 
the difference of the predictable quadratic variations 
{M,M) - {M',M') is nondecreasing. 

Note that we may even choose 5 bounded, because M is bounded [the fact 
that all the above processes are martingales and not only local martingales 
comes from the boundedness of M and of /(x^ A 1) Ft{dx)]. We will also use 
the following consequence of the third property above: 

Et(^j'^'du J Ft+u{x)5{t + u,xf 

(81) =Et((M',M')t+.-(M',M': 

<^t{{Mt+s-Mtf). 

With t being fixed below, for simplicity, we write = Xj-j^^ — X[. Since M 
is a bounded martingale and Y a semimartingale with vanishing continuous 
martingale part, and fn{Y) is bounded, we deduce, from Ito's formula and 
the properties (80), that the product (Mj+r — Mt)fn(Yr) is the sum of a 
martingale plus the process Jq^'^cIu, where 



-f'^ = J Ft+u{dx){{Mt+u - Mt)gn{Yu,x) + 6{t + u,x)hn{Yu,x)), 

where 

hn{y,x) = fn{y + x) - fn{y), gn{y,x) = hn{x,y) - fli{y)xl^\^\<i}. 
An easy computation allows us to deduce of (77) that 

\hn{y,x)\ <K^y^, 

\gn{y,x)\ < +i^l{aA-<|y|<aA-(l+A:j)} 

|a;|Al 

:lri„.i^„,A^+';i H . ^_L^ lri„i^„,A='+»n 



Now, we apply the first estimate of (47) with x = aA^"*"^ and the second 
and third ones with u = aA^"*"^ plus Cauchy-Schwarz inequality, to get, for 
any e > (recall that 6 is bounded), 

+ \Mt+u - Mil A-(^+^)(-+'')l|„A^<iy„|<aAjr(i+A„,)}. 
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Since 77 < 1/2 — tz7, we have + 77) < 1 and, thus, {13 + e){w + rj) = 1 
for a suitable e > 0. Moreover, Ej(|Z„|) < E((|Zjj|) if u < s, because Z is a 
martingale. Therefore, since M is bounded and s < A„, we get 

|Et((Mi+,-Mi)/n(x;+,-x;))i 



Yrdr 



Jo 

+ a(,i-(-+'')^)/2 (Et du J Ft+u{x)6{t + u, xf 



1/2 



+ KA-^ Pt(aA^ < \Yr\ < aA^(l + A„7?)) dr, 
Jo 

where we have used Cauchy-Schwarz inequality. By (65), for the process X' 
instead of X and by (81), we readily deduce (79). □ 

8.3. Some auxiliary limit theorems. Below, recall the process A of (11). 
We still assume Assumptions 5 and 6 and also m £ (0, ^) and a > 0. 

Lemma 8. Let p' = ^ A {vo{(5 - (/? - 7) V [5')). Then, for all t > 0, the 
sequence 

lt/A„] .^^^ 

Y: A-%ILi / FtiaA^)dt 

fr[ \J{i-i)A„ 



(82) 
is tight. 



A~P 



n>l 



Proof. Let 6^ = Slt\)l^^ Ft{aA^) dt and 7?f = jlt\)^^ M dt. We de- 
duce from (47) that 



\mfinn 
n i 



.fs'li 



<_^^l+t^7{/3-(/3-7)V/3')_ 



Then, obviously. 



/ [*/An] 
\ i=l 



A^^r ■ 



<Kt, 



and, since At is bounded, we have \At — X]f=f vTl ^ KtA^, whereas p' < I. 
It thus remains to prove that 



(83) 



the sequence A~ 



[i/An] 

i=l 



is tight. 



n>l 
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Since = ^n^' iv? ~ ^i-ii'Hi')) is a martingale increment, for (83), it is 
enough to check that a„(t) = E(X]|=f (C")^) is bounded. However, since 
At is bounded, we have KfP < KA^-'^p', so a.„(t) < KtAl~^P' < K because 
p'<l/2. □ 



Lemma 9. (a) Let 
(84) x' 



, l-ro/? zu(p-P') l-2w 
w-f) A A ^, ' A . 

Then, for all e > and all t> the sequence of variables 

[i/An] 



(85) 



E IPr-i(|Ar^|>aA-)-^ 



a' 



is tight, and, in particular, we have 

[t/Ar. 

(86) 



E Pr-i(|Ar^|>aA-) 



i=l 



a 



/3' 



(b) 7/ further < fl^^c^ 7 > f '^'^^ ^ < ^ if M is a 

hounded continuous martingale, we also have 



(87) 



A^r^ E IPr-i(|A?X|>aA-)--| 



i=l 

[i/An] 



0, 



^ |E^_i(ArMl||A.^|>„^^})|^0. 



i=l 



Proof, (a) Let G (0, ^ — ro) and p given by (63) and p' like in Lemma 8. 
From (64) and (82) we deduce the tightness of the sequence (85), provided 
we substitute 'x' — e with p' t\ p. It is thus enough to show that we can 



choose 77 < ^ 2 ^ such that A p = %' — e, and this is achieved by taking 



_ 1-^/3 . ^{13-13') . 1- 
'/ — 3 /\ i^f^, /\ 



simple computation shows. 

(b) In view of (a), (87) follows from the property x' > 'cu(3/2, which is 
an easy consequence of the assumptions. 

It remains to prove (88). By (76), the left-hand side of this expression is 
smaller than 

[i/An]+l 

irt ^ J2 E7_i(|A7M|) 

i=l 

[t/AnH 

+ KA^^-^f^/^ E 

i=l 



Ef_,{\AfM\). 
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By the Cauchy-Schwarz inequality, this is smaller than 

/ /[t/A„]+l X l/2x 

A well known property of martingales yields 

Ei ]E?_i(|A?M|2)j=E((MA„([t/A„]+i-Mo)'), 

which is bounded (in n). Therefore, we deduce that (88) holds, provided we 
have p > w(3/2 and also 1 — 2^7/3 > w(3. The first condition has already been 
checked, and the second is satisfied because ??<(! — w(3)/2>. This ends the 
proof. □ 

8.4. Law of large numbers and central limit theorems. Here, again, we 
assume Assumptions 5 and 6, and we have a > and w S (0, 1/2). 



Proposition 1. Ifx^s given by (19), then for each t>0 and each e> 
the sequence 

At 



n>l 



is tight, and, in particular, 

(90) A-%(n7,a)r^4- 
Proof. Set 

(91) Cr = A^^/2(l{|A^x|>aAjf}-Pr-i(|ArX| >aA-)). 

By virtue of Lemma 9, and since x = x' ^ (^/?/2), it suffices to prove that 
the sequence X]f=^"' Ci' is tight. Since the Cj^'s are martingale increments, 
it is enough to show that the sequence a„, (t) = Efif IE((Cf )2) is bounded. 
But (66) yields < KA^, which in turn yields a„(t) < Kt. □ 

2+p and-f>^ 



Proposition 2. Let a' > a. If we have f3' < and 7 > f and zu < 



2^ A the pair of processes 

(92) A--P/'^A^^U{w,a)^ - ^, A^^U{w,a')^ - 

converges stably in law to a continuous Gaussian martingale {W, W') inde- 
pendent of J-', with 

(93) mt)=§, ^(^'")=^^ nWtW[)=^. 



ESTIMATING JUMP ACTIVITY 



43 



Proof. Define Cj" by (91) and associate Ci" with oi in the same way. 
The variables Cf and ^j'" are martingale increments and smaller than K/S!^^^'^ . 
So, in view of (87), it is enough, by using a criterion for stable convergence of 
triangular arrays found in Jacod and Shiryaev (2003) (see Theorem IX. 7. 28) 
to prove the following: 



(94) 



[i/An] 

E 

i=l 

[t/An] 



iEr_i((c 



n\2\ 



i=l 

E 

i=l 



lE^i(crcr) 



(95) 



[i/An] 

i=l 

[t/An] 

Eti(CArM) 



E 

i=l 



A 
A 

^0, 

F 



where M is any bounded martingale. 

Recalling a < a' and also Pf_i(|A^X| > aA^) < J^A^-^^^ by (66), we 
deduce (94) from (86). As for (95), and since Ef_i(AfM) = 0, it readily 
follows from (88). □ 



8.5. Proofs of the main theorems. 



Proofs of Theorems 1 and 2. As said before, we can assume As- 
sumptions 5 and 6. We will write r/ = x — ^ if we want to prove the first the- 
orem, in which case we choose e small enough to have r/ > 0, and rj = wf3/2 
if we want to prove the second one. Then, we set 

Vn = A-''(A-%(tz7, a)r - At/a^), = A-\A^Pu{zu, a')f - A/a'^), 



so that on the set {At > 0} we have 
(96) A-'^0r,{t,w,a,a')-P) 



A"'' 



■log 



l + a/^A^Vn/At 



log(aVa) ^l + a'f'AIVJ./At' 

In all cases, the two sequences (V^) and (V^) are tight, so the right-hand 
side is equivalent (in probability) to the variable {a^Vn — a'^V^)/{At log(a'/a)) 
on the set {At > 0}. At this stage. Theorems 1 and 2 readily follow from 
Lemma 9(a) and Proposition 2, respectively. □ 
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Proof of Theorem 3. This is a trivial consequence of the properties 
of the stable convergence in law and of Theorem 2. □ 

Proofs of Theorems 4 and 5. In the situation of these two theorems, 
we have At = At> 0, so we can apply Theorems 1 and 3 with {Af > 0} = 0. 
Indeed, we have Assumptions 1 and 2 with /3' = and 7 arbitrary large. 
So, the only differences are on the conditions on w, namely w <1/P for 
the consistency and zu < 2/(3/3) for the CLT in the first theorem, and w < 
1/(2 + /3) for the CLT in the second one, instead of < 1/2 in Theorem 1 
and uj < 2^ A ^ in Theorem 3. 

In fact, the improvement lies in Lemma 6. When X = Y we have the 
trivial estimate, coming from (24) and from the scaling property of Y, 

As 

(97) Ft{\Xt+s -Xt\>x)= F{\Xs\ >x) = -, + 0{s^/x^f), 

regardless of the value of x > 0. So, we have all claims of Lemma 6 with any 
?7 S (0, 1 — vuP] and with p = 1 — w(3, and, thus, (a) of Lemma 9 holds for 
some X > as soon as zu <l/f3, whereas (b) of that lemma holds if further 
p > -07/3/2, that is, when w <2/ (3/?). 

When Xt = crWt + It, we obtain for any 77 G (0, 1/2 — zu) and s < A„ 

Ft{\Xt+s - Xt\ > aA-) = F{\Xs\ > aA^) = ^ + 0(A^-i-^+''), 

[by applying the estimate (97) for Y and the fact that E{\X^\p) = KpsP/"^ 
and Bienayme-Tchebycheff inequality]. Then, we have all claims of Lemma 
6 with any r/ G (0, (^ — to) A (1 — to/?)) and with p = rj, and, thus, (b) of 
Lemma 9 holds as soon as we can find rj as above, with p = r] > This is 
possible, provided to < A j^. The rest of the proofs of Theorems 1 and 
3 goes through, and we are finished. □ 
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